Classifying gene expression profiles from pairwise mRNA comparisons.
نویسندگان
چکیده
We present a new approach to molecular classification based on mRNA comparisons. Our method, referred to as the top-scoring pair(s) (TSP) classifier, is motivated by current technical and practical limitations in using gene expression microarray data for class prediction, for example to detect disease, identify tumors or predict treatment response. Accurate statistical inference from such data is difficult due to the small number of observations, typically tens, relative to the large number of genes, typically thousands. Moreover, conventional methods from machine learning lead to decisions which are usually very difficult to interpret in simple or biologically meaningful terms. In contrast, the TSP classifier provides decision rules which i) involve very few genes and only relative expression values (e.g., comparing the mRNA counts within a single pair of genes); ii) are both accurate and transparent; and iii) provide specific hypotheses for follow-up studies. In particular, the TSP classifier achieves prediction rates with standard cancer data that are as high as those of previous studies which use considerably more genes and complex procedures. Finally, the TSP classifier is parameter-free, thus avoiding the type of over-fitting and inflated estimates of performance that result when all aspects of learning a predictor are not properly cross-validated.
منابع مشابه
Novel Extension of k-TSP Algorithm for Microarray Classification
This paper presents a new method, referred as Weight k − TSP , which generates simple and accurate decision rules that can be widely used for classifying gene expression data. The proposed method extends previous approaches: TSP and k−TSP algorithms by considering weight pairwise mRNA comparisons and percentage changes of gene expressions in different classes. Both rankings have been modified a...
متن کاملThe Effects of Aerobic Exercise in Sprague Dawley pregnant Rats on vascular BCL-2,BAX AND eNOS Gene Expression in adult male offspring
Background: One of the most growing diseases is the onset of atherosclerosis which can be started from a fetal age. As a result, the intrauterine environment plays a role at risk of spreading this disease. Changes in vascular function considered to be as an indicator for vascular disease. Effective factors on vascular function are enzyme eNOS and apoptotic regulator factors included BCL-2 an...
متن کاملThe cucurbitacins D, E, and I from Ecballium elaterium (L.) upregulate the LC3 gene and induce cell-cycle arrest in human gastric cancer cell line AGS
Objective(s): Cucurbitacins exhibit a range of anti-cancer functions. We investigated the effects of cucurbitacins D, E, and I purified from Ecballium elaterium (L.) A. Rich fruits on some apoptotic and autophagy genes in human gastric cancer cell line AGS. Materials and Methods: Using quantitative reverse transcription PCR (qRT-PCR), the expression of LC3, VEGF, BAX, caspase-3, and c-MYC genes...
متن کاملInvestigating the CTGF mRNA Expression Level in Patients with Colorectal Cancer
Background: The Connective Tissue Growth Factor (CTGF) gene encoding an extracellular matrix (ECM)-associated protein and as a member of the CCN family of proteins plays a major role in fibrosis, inflammation and connective tissue remodeling in a variety of diseases including cancer. The CCN proteins are multifunctional and are involved in cell proliferation, adhesion and cell development durin...
متن کاملO-12: Study of Expression of DevelopmentalGenes in SCNT Cloned Embryos
(SCNT) embryos of buffaloes. 2. To study gene expression profile of important developmental genes at different stages of SCNT cloned embryo. 3. To study epigenetic reprogramming during early developments of SCNT embryos Materials and Methods: Expression analysis of developmental genes was done in different (ovarian granulose and cumulus and skin fibroblasts) donor cells; in vitro maturing oocyt...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Statistical applications in genetics and molecular biology
دوره 3 شماره
صفحات -
تاریخ انتشار 2004